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Abstract 

Packet erasure codes are today a real alternative to repli¬ 
cation in fault tolerant distributed storage systems. In 
this paper, we propose the Mojette erasure code based 
on the Mojette transform, a formerly tomographic tool. 
The performance of coding and decoding are compared 
to the Reed-Solomon code implementations of the two 
open-source reference libraries namely ISA-L and Jera- 
sure 2.0. Results clearly show better performances for 
our discrete geometric code compared to the classical al¬ 
gebraic approaches. A gain factor up to 2 is measured in 
comparison with the ISA-L Intel. Those very good per¬ 
formances allow to deploy Mojette erasure code for hot 
data distributed storage and I/O intensive applications. 

1 Introduction 

Storage systems rely on redundancy to face ineluctable 
data unavailability and component failures. For its sim¬ 
plicity, data replication is the de facto standard to pro¬ 
vide redundancy. For instance, three-way replication is 
the storage policy adopted by major file systems such 
as Hadoop Distributed File System im and Google File 
System [I]. While being straightforward to implement, 
plain replication typically incurs high storage overheads. 
It has now been acknowledged that erasure codes can sig¬ 
nificantly reduce the amount of redundancy compared to 
replication while offering the same data protection M- 
However, these storage savings come at a price in 
terms of additional complexity, as data must be encoded 
during write operations, and decoded during read oper¬ 
ations. Very efficient coding operations are thus keys to 
maintain transparent operations for I/O intensive applica¬ 
tions. Since data replication has higher storage costs but 
performs faster than erasure codes, storage systems tend 
to differentiate between cold data (i.e. not frequently ac¬ 
cessed, such as in long term storage) and hot data, typi¬ 
cally data that is frequently accessed. In practice, plain 


replication is used for I/O intensive applications due to 
fast data accesses while erasure codes are limited to long¬ 
term storage because of their extra complexity. 

Reed-Solomon (RS) are the most popular codes as 
they provide deterministic general-purpose codes with¬ 
out limit on the parity level. They are mostly imple¬ 
mented in their systematic form, meaning that the in¬ 
formation data is a part of the encoded data. In addi¬ 
tion, they are known to be Maximum Distance Separable 
(MDS) thus providing the optimal reliability for a given 
storage overhead. The former definition of RS codes are 
based on Vandermonde matrices and expensive Galois 
field operations. Implementing such codes in an effi¬ 
cient manner is therefore challenging. One of the best 
known implementation is provided by Jerasure ID, an 
open-source library that relies on Cauchy generating bit- 
matrices to only perform XOR operations, thus avoiding 
the costly multiplications. Recently, Intel released ISA- 
L, a performance-oriented open-source library ID that 
implements RS codes leveraging SIMD instructions. To 
the best of our knowledge, these two are the most effi¬ 
cient implementations publicly available. 

In this paper, we propose to use the Mojette trans¬ 
form El, a formerly tomographic tool, to implement a 
high-performance erasure code. The Mojette transform 
is a discrete and exact version of the Radon transform 
and relies on discrete geometry, contrary to the classic 
algebraic code definition. By nature, the Mojette trans¬ 
form provides a non-systematic erasure code. The geo¬ 
metric approach coupled with an optimized implemen¬ 
tation help to perform very fast encoding and decoding 
operations, handling I/O intensive applications such as 
virtualization or databases, that access small blocks of 
data (4 KB or 8 KB) in a random pattern |0. Those 
block sizes fit the general-purpose file systems require¬ 
ments such as ext4 or Btrfs. 
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Figure 1; Mqjette transform of a 3 x 3 image for direc¬ 
tions {p,q) in the set {(—1,1), (0,1), (1,1), (2,1)}. Ad¬ 
dition is done here modulo 6. 

2 The Mojette Erasure Code 

This section presents how the Mojette transform is used 
to encode data, the uniqueness conditions of the recon¬ 
struction solution and its inverse algorithm enabling the 
decoding. Finally, the end of this section details how the 
Mojette transform is used as an erasure code in practical 
systems. 

2.1 Forward Mojette Transform 

The forward Mojette transform is a linear operation that 
computes a set of ID projections at different angles, from 
a discrete image /; composed of PxQ pixels. 

A projection direction is defined by a couple of co-prime 
integers {p,q). Projections are vectors of variable sizes 
whose elements are called bins. A bin in the Mojette 
transform of / is characterised by its position b in the 
projection which corresponds to a discrete line of equa¬ 
tion b = —kq Alp. Its value is the sum of the centered 
pixels along the line; 

= Y^f{k,i)[b=-kq+ip], ( 1 ) 

k=Q 1=0 

where, [•] is the Iverson bracket ([P] = 1 whenever F is 
true, 0 otherwise). The number of bins B of a projection 
depends on the projection direction (p,q) and the lattice 
size PxQ-. 

B{p,q,P,Q) = \p\ (e- 1) + \q\ {P-l) + 1. (2) 

Figure gives an example of the forward Mo¬ 
jette transform for a 3 x 3 integer image. The pro¬ 
cess transforms the 2D image into a set of / = 4 
projections along the directions of the following set: 


{(—1,1), (0,1), (1,1), (2,1)}. For the sake of this exam¬ 
ple, addition is arbitrarily done modulo-6 (but any addi¬ 
tion works). The complexity ffiPQI) is linear with the 
number of projections and the number of grid elements. 
Note that some border bins are the exact copy of some 
pixels. This remark will help to understand how starts 
the inverse transform algorithm. 

2.2 Inverse Mojette Transform 

In this section, we first expose the reconstruction crite¬ 
rion on the projection set which yields to a unique recon¬ 
structed image. Then, we detail how is implemented the 
reconstruction algorithm. 

Reconstruction Criterion Katz has shown that for a 
PxQ lattice, the reconstruction is possible given a pro¬ 
jection set Si if one of the following criterion is veri¬ 
fied E): 

P <Y^\pi\oT^Q<Y^\qil (3) 

/=0 /=0 

where I is the number of projections involved in the re¬ 
construction process. 

In the example of the Figure [T] we see that each sub¬ 
set of 3 projections {(poj?o)j • • • i (P 2 j? 2 )} is such that 
L?=0 \^i\ Thus, the 4 projections in Figure [^de¬ 
picts a redundant representation of the image, where any 
3 projections among these 4 can be used for reconstruc¬ 
tion. 

Inverse Mojette Algorithm The reconsti'uction algo¬ 
rithm aims at finding a reconstructible bin and to write 
its value in the image by back-projection. Bins are re¬ 
constructible when they result from a unique pixel of the 
image. Once a bin is reconstructed, its contribution is 
subtracted from all the projections involved in the recon¬ 
struction, thus paving the way to reconstruct further bins. 
As the forward algorithm, the Mojette inverse is linear 
with the number of projections I and the number of ele¬ 
ments P xQ 'm the grid. 

Observing that the reconstruction propagates from the 
image corners to its center, Normand et al. m showed 
that given an image domain and a projection set, a de¬ 
pendency graph between the image pixels can be found. 
Within this graph, considering that a single projection is 
dedicated to the reconstruction of a single line of the im¬ 
age, a reconstruction path can be pre-determined. We 
refer the interested reader to due to lack of space. 

2.3 Properties of the Mojette Erasure Code 

The Mojette erasure code extends the application of the 
Mojette transform, originally designed for images, to any 
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type of data. As the Mojette transform creates a redun¬ 
dant representation, it appears to be an appealing candi¬ 
date to provide failure tolerance in storage systems. In 
classic coding theory words, we consider the k lines of 
the Mojette array as the input data packets, and we com¬ 
pute n projections as the set of encoded packets. The 
Mojette erasure code is therefore non-systematic here. 
Note that a systematic version of the Mojette is currently 
under development. Since the size of projection varies 
with the parameters {p,q) we consider for each projec¬ 
tion that qt = 1 to limit the bin overhead (as proposed 
in Q). Then the Katz criterion proves that if we get any 
k out of the n projections, it is possible to exactly recon¬ 
struct the array. This way, the storage system is able to 
face the unavailability of up to any n — k storage nodes. 

In practice, we can observe that some bins are never 
used during reconstruction whatever the projection set 
used for the process ca. Removing these bins from 
the encoding process, particularly when p increases, sig¬ 
nificantly limits the projection size variation and there¬ 
fore yields to a negligible storage overhead relative to 
the MDS case. 

3 Erasure Code Micro-benchmark 

In this section, we evaluate the performance of our new 
erasure code compared to the Jerasure and ISA-L li¬ 
braries. Firstly, we describe our Mojette implementation 
and then present our two competitors. Secondly, we de¬ 
tail the experiment setup to finally depict the results and 
analysis in the last section. 

Mojette We implemented a non-systematic version of 
the Mojette erasure code in C. In practice, pixel size 
should fit a computer word to improve performance 
based on XOR operations. Since x86 architectures pro¬ 
vide Streaming SIMD Extensions (SSE) instruction set, 
pixel and bin sizes are set to 128 bits to benefit from high- 
performance XOR computations. The Mojette encoding 
requires at most k—\ XORs per computed bin (and zero 
XOR for bins at projection edges). Similarly for decod¬ 
ing, at most k—\ XORs are required per reconstructible 
pixel. The progressive reconstruction from left to right 
of connected pixels (as proposed in ||6l), coupled with a 
drastic reduction of updates, guarantees spatial memory 
locality, thus high-performance computation. 

Jerasure The first competitor is the systematic Van¬ 
dermonde implementation of RS codes from the open- 
source Jerasure 2.0 library fO). We choose their Van¬ 
dermonde implementation since it performs better than 
their Cauchy-based implementation for such small pack¬ 
ets size. The Galois field size is set to w = 8 to fit our 
erasure code configuration {n,k). 


Intel ISA-L The second competitor is the RS im¬ 
plementation provided in Intel ISA-L open-source 
library i). It is one of the fastest systematic erasure 
code implementation since it makes intensive usage of 
the x86 architecture features such as xmm registers and 
SSE instructions. 

3.1 Experiment Setup 

We conducted all experiments on small data blocks of 
BlockSize equals to 4 KB and 8 KB (that fit block-based 
file-system like ext4). Eor encoding, we consider a sin¬ 
gle data block filled with random data. For decoding, we 
record the performance as we increase the number of era¬ 
sures up to the failure tolerance. With systematic codes 
(implementations of ISA-L and Jerasure), erasures only 
concern data packets (no decoding otherwise). 

Two erasure code configurations are considered for 
the benchmark: {n,k) equals (6,4) and (12,8), prevent¬ 
ing from 2 and 4 failures respectively. All the com¬ 
putations are performed in memory, with no disk I/O 
operation. Furthermore, we do not take into account 
pre-computations such as the matrix inversion or de¬ 
terministic reconstruction path respectively for the RS 
and Mojette implementations. Since we measure opti¬ 
mized encoding and decoding functions that are mostly 
computation-bounded, with the data entirely located in 
LI and L2 cache, we use the RDTSC instruction that re¬ 
turns the time stamp counter (TSC) which is incremented 
on every CPU clock cycle E). For all tested implemen¬ 
tations, the standard deviation is too negligible to be rep¬ 
resented (less than 1%). 

All the experiments are done on a single processor 
running Linux 3.2 and Debian Wheezy over an x86-64 
architecture. It embeds a 1.80 GHz Intel Xeon proces¬ 
sor, with 16 GB of RAM and cache sizes of 32, 256 and 
10240 KB for respectively LI, L2 and L3 cache levels. 

3.2 Results 

We now present the results of encoding and decoding 
throughput for various BlockSize and code parameters. 
For the sake of comparison, we plot the optimal perfor¬ 
mance recorded by the memcpy(). More precisely, the 
optimal encoding is given by the memcpyO of n pack- 
ets of bytes while the optimal decoding is the 

memcpyO of only k packets among the n encoded. Once 
again, note that the Mojette is implemented as a non- 
systematic code, thus increasing the overall computation 
compared to the two other systematic codes. 

Encoding Figure shows the encoding performance 
recorded for 4 KB (top) and 8 KB (bottom) data blocks 
for the (6,4) and (12,8) codes. The first observation is 
that the Mojette erasure code outperforms the two other 
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(6,4) encoding (12,8) encoding 

Figure 2; Encoding performance for an input data block 
of 4 KB (top) and 8 KB (bottom) depending on the code 
parameters {n = 6,k = 4) or (n = 12,k — 8). 

implementations in every tested settings. For example, to 
encode a 4 KB block with a (6,4) code, the Mojette im¬ 
plementation divides the number of CPU cycles by a fac¬ 
tor of 1.94 and 3.42 when respectively compared to ISA- 
L and Jerasure. While the Mojette implementation still 
provides the closest performance from the optimal value 
of the memcpyO, the improvements are mitigated for the 
code (12,8), and especially versus ISA-L for a block of 
8 KB. This is mainly due to our non-systematic design. 
Indeed, to encode a 8 KB block with a (12,8) code, the 
encoder splits 8 KB into 8 packets of one kilobyte and 
produces 4 encoded packets for systematic codes, while 
non-systematic codes have to compute 12 encoded pack¬ 
ets thus performing 3 times more computations. Finally, 
we notice that, as expected, the CPU cycles number lin¬ 
early increases with the BlockSize, as well as with the 
number of blocks to be encoded thus experimentally con¬ 
firming the linear complexity of the Mojette transform. 

Decoding We respectively plot in Figure ?? the num¬ 
ber of CPU cycles required to decode the data for the 
same codes as before, depending on the number of fail¬ 
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Figure 3: Decoding performance of a (n = 6,k = 4) code 
for an input data block of 4 KB (top) and 8 KB (bottom) 
depending on the number of failures. The dashed line 
depicts the optimal value of the memcpyO respectively 
measured at 336 (4 KB) and 603 (8 KB). 

ures. We still emphasize here the differences between 
systematic and non-systematic implementations. Since 
RS codes are systematic, when no failure occurs, they 
should achieve optimal performance (equivalent to a 
memcpyO) as the decoding process boils down to the 
copy of k data blocks in memory. For example, we see 
that ISA-F delivers the optimal performance for every 0- 
erasure settings. Note that our ongoing implementation 
of the Mojette in systematic-form would also provide the 
same results. 

We now focus on the results in the presence of fail¬ 
ures, when decoding operations are therefore involved 
(i.e. we do not just retrieve the data packets in memory). 
Results for the code (n = 6,k = 4) on a 4 KB block, de¬ 
picted on top of the Figure show that the number of 
CPU cycles is divided by a factor of 2.2 and 4.8 when 
respectively compared to ISA-F and Jerasure for a single 
failure. These factors are even higher when two erasures 
occurred. In fact, due to its non-systematic form, the 
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Results show that this paradigm shift towards a geomet¬ 
ric approach enables the Mo;e?fe-based implementation 
to significantly improve the throughput of coding and 
decoding operations. As non-systematic, the proposed 
code can still bring better throughputs in a foreseeable 
future. A Mojette erasure code implementation is cur¬ 
rently deployed in an open-source project RozoFS noi. 
We believe that this new code paves the way to the use 
of erasure codes in I/O intensive applications. 
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Eigure 4; Decoding performance of a (n = 12, k = 8) 
code for an input data block of 4 KB (top) and 8 KB 
(bottom) depending on the number of failures. The 
dashed line depicts the optimal value of the memcpyO 
respectively measured at 411 (4 KB) and 711 (8 KB). 

set of projections used has no influence on the decoding 
performances of the Mojette. On the contrary, the per¬ 
formances of Jerasure and ISA-L progressively decrease 
with the number of erasures. Although this performance 
gap is reduced for the code (n = 12, k — 8), especially 
versus ISA-L, results presented in Eigure still confirm 
the above observations. 

4 Conclusion 

Erasure codes are well known to incur a high compu¬ 
tational penalty due to their inherent coding operations, 
thus preventing them from being deployed in I/O inten¬ 
sive applications. In this paper, we advocated that the 
Mojette transform is a particularly suitable tool to design 
high-performance erasure code. We implemented and 
evaluated our new erasure code compared to the best- 
known implementations, namely ISA-L and Jerasure. 
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